Testing for Simultaneous Pairwise Marginal Independence
نویسندگان
چکیده
1 Introduction What types of cars do you own? What are your sources of veterinary information? For what criminal offenses have you been arrested? These are all example questions appearing on surveys where the respondent is prompted to pick any number of responses from a set of variables that summarize this type of survey data have been called multiple-response (or pick any/c) categorical variables. The survey data arising from questions of this type present a unique challenge for analysis because of the dependence among responses provided by individual subjects. For categorical variables in general, testing for independence between variables is often of interest. When at least one of the categorical variables can have multiple responses, traditional Pearson chi-square tests for independence should not be used because of the within-subject dependence among responses. Furthermore, a special kind of independence, called marginal independence, becomes of interest in the presence of multiple-response categorical variables. The purpose of this paper is to develop new approaches to the testing of marginal independence in an r×c contingency table with two multiple-response categorical variables. Agresti and Liu (1999) call this a test for simultaneous pairwise marginal independence (SPMI). an example where SPMI can be tested. The data comes from a survey conducted by the Department of Animal Sciences at Kansas State University. Two questions in the survey asked Kansas farmers about their sources of veterinary information and their swine waste storage methods. For each of these two questions, the farmers were permitted to select as many responses as applied from a list of items. The data are summarized as a 4×5 contingency table in Table 1. For example, 34 farmers picked professional consultant as a source of veterinary information and lagoon as a waste storage method. A test for SPMI determines if each source of veterinary information is simultaneously independent of each swine waste storage method. More specifically, 4 * 5=20 different 2×2 tables can be formed marginally summarizing all possible responses to item (category) pairs. Table 2 shows the 2×2 table for professional consultant and lagoon. Independence is tested in each of the 20 2×2 tables simultaneously for a test of SPMI. The test is marginal because responses are summed over the other item choices for each of the multiple-response categorical variables. This test may be of interest to agricultural companies who would like to use the information for marketing purposes. Tests for marginal independence have …
منابع مشابه
Learning from Pairwise Marginal Independencies
We consider graphs that represent pairwise marginal independencies amongst a set of variables (for instance, the zero entries of a covariance matrix for normal data). We characterize the directed acyclic graphs (DAGs) that faithfully explain a given set of independencies, and derive algorithms to efficiently enumerate such structures. Our results map out the space of faithful causal models for ...
متن کاملA SINful Approach to Model Selection for Gaussian Concentration Graphs
A multivariate Gaussian graphical Markov model for an undirected graph G, also called a covariance selection model or concentration graph model, is defined in terms of the Markov properties, i.e., conditional independences associated with G, which in turn are equivalent to specified zeroes among the set of pairwise partial correlation coefficients. By means of Fisher’s z-transformation and Šidá...
متن کاملA Univariate Marginal Approach for Pairwise Testing of Software Product Lines
Software Product Line (SPL) is a software engineering paradigm that is inspired by the concept of reusability of common features, formulated for different software products. Complete testing of all software products in SPL is known to be unfeasible. This is due to the very large number of possible products that can be produced or configured using a combination of features in the SPL. Pairwise T...
متن کاملTwo-stage strategies to detect gene × gene interactions in case-control data
Large genetic association studies based on hundreds of thousands of single-nucleotide polymorphisms (SNPs) are a popular option for the study of complex diseases. The evaluation of gene x gene interactions in such studies is a sensible method of capturing important genetic effects. The number of tests required to consider all pairs of SNPs, however, can lead to a computational burden, and effic...
متن کاملIncorporating spatial dependence in regional frequency analysis
The efficiency of regional frequency analysis (RFA) is undermined by intersite dependence, which is usually ignored in parameter estimation. We propose a spatial index flood model where marginal generalized extreme value distributions are joined by an extreme-value copula characterized by a max-stable process for the spatial dependence. The parameters are estimated with a pairwise likelihood co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002